Scoring the Forecaster by Mean Resulting Payoff of a Distribution of Decision Problems
نویسنده
چکیده
We present a new way to understand and characterize the choice of scoring, objective , or loss function for a model, machine, or expert estimating (i.e. predicting) the probabilities of events or outcomes (such as class membership). The ultimate value of a probability estimate lies in the actual payoo (utility) accruing to those who use this information to make a decision. We allow that we often cannot specify with certainty that the estimate will be used in a particular decision problem, characterized by a particular decision-outcome payoo matrix (cost schedule), and thus by a particular decision threshold. Instead, we consider the more general case of a distribution over such matrices. The proposed scoring function is the expectation, with respect to this distribution, of the payoo that will actually be received. Square-error loss (or Brier score) and log-likelihood (or various entropic measures) arise from speciic examples of such distributions, and even common single-threshold measures such as the misclassiication rate obtain from degenerate special cases. Our scoring functions are always \honesty-rewarding," or maximized in expectation when the estimate equals the true outcome probability. There is still no single \universal" choice of scoring function appropriate in all cases, but speciic choices correspond to speciic assumptions about the probability that the predictions will be thresholded in various ways when used to make a decision.
منابع مشابه
Reduced Variance Payoff Estimation in Adversarial Bandit Problems
A natural way to compare learning methods in nonstationary environments is to compare their regret. In this paper we consider the regret of algorithms in adversarial multi-armed bandit problems. We propose several methods to improve the performance of the baseline exponentially weighted average forecaster by changing the payoff-estimation methods. We argue that improved performance can be achie...
متن کاملA simple approach to multiple attribute decision making using loss functions
Multiple attribute decision making (MADM) methods are very much essential in all fields of engineering, management and other areas where limited alternatives exist and the decision maker has to select the best alternative. Different methods are available in the literature to tackle the MADM problems. The MADM problems are classified as scoring methods, compromising methods and concordance metho...
متن کاملIncentive-Compatible Forecasting Competitions
We consider the design of forecasting competitions in which multiple forecasters make predictions about one or more independent events and compete for a single prize. We have two objectives: (1) to award the prize to the most accurate forecaster, and (2) to incentivize forecasters to report truthfully, so that forecasts are informative and forecasters need not spend any cognitive effort strateg...
متن کاملStrictly Proper Scoring Rules, Prediction, and Estimation
Scoring rules assess the quality of probabilistic forecasts, by assigning a numerical score based on the forecast and on the event or value that materializes. A scoring rule is strictly proper if the forecaster maximizes the expected score for an observation drawn from the distribution F if she issues the probabilistic forecast F , rather than any G 6= F . In prediction problems, strictly prope...
متن کاملTwo Stage Multiple Attribute Decision Making Problem in Iranian Gas Distribution Systems
The purpose of this paper is to present the possibility of replacing physical unit cost in transportation or distribution problems by an aggregate coefficient, getting qualitative and subjective considerations involved. The model for constructing aggregate cost is a two stage multiple attribute decision-making problems. In the first stage supply points, demand points and routes of transportatio...
متن کامل